Writer Identification with Hybrid Edge Directional Features and Nonlinear Dimension Reduction
نویسنده
چکیده
Writer identification as an interesting pattern recognition topic has attracted many researchers in forensic and biometric applications, where the identity authentication is realized by the writing style as biometric features. The core issue in writer identification, therefore, is the extraction of unique features, by which the individualistic of such handwriting styles can be faithfully represented. Many writer identification methods based on handwritten text lines have been proposed recently. In this paper, an effective method to perform writer identification is proposed by extracting edge directional features based on the recently proposed local shape descriptor Pyramid Histogram of Oriented Gradients (PHOG) and a curve fragment feature which can be termed as Histogram of Edge Direction (HED). PHOG represents the local shape by a histogram of edge orientations computed for each image sub-region, quantized into a number of bins. HED is based on a new edge detection and edge linking algorithms which robustly extract well-localized sub-pixel edges and stably links these into curve fragments. Compared with many previously proposed feature extraction approaches in writer identification research, PHOG and HED have the advantages in the effective extraction of discriminating information from handwritten text images and a hybrid feature from the combination of them can give high identification accuracy. The identification performance can be further enhanced by applying a graph embedding based nonlinear dimension reduction. The writer identification experiment is thoroughly studied in the general pattern classification framework, with five different commonly applied classifiers compared, including linear discriminant analysis (LDA), Naive Bayes, k-Nearest Neighbor, multiple layer perceptron (MLP) and Support Vector Machine (SVM). The experiments Preprint submitted to ??? January 28, 2010 have shown that the classifiers can all give very satisfactory performance but LDA is the best. On a public benchmarking IAM database, experiments demonstrates an average accuracy 96% from LDA, which compares sharply with the latest published results on the same dataset, thus illustrating the superiority of the proposed method.
منابع مشابه
Writer Identification Using Edge-Based Directional Features
This paper evaluates the performance of edge-based directional probability distributions as features in writer identification in comparison to a number of non-angular features. It is noted that the joint probability distribution of the angle combination of two ”hinged” edge fragments outperforms all other individual features. Combining features may improve the performance. Limitations of the me...
متن کاملStatistical Pattern Recognition for Automatic Writer Identification and Verification
This chapter evaluates the performance of edge-based directional probability distributions as features in writer identification in comparison to a number of other texture-level features encoding non-angular information. We introduce here a new feature: the joint probability distribution of the angle combination of two ”hinged” edge fragments. It is noted that the ”edge-hinge” distribution outpe...
متن کاملSparse-parametric writer identification using heterogeneous feature groups
This paper evaluates the performance of edge-based directional probability distributions as features in writer identification in comparison to a number of non-angular features. It is noted that angular features outperform all other features. However, the non-angular features provide additional valuable information. Rank-combination was used to realize a sparse-parametric combination scheme base...
متن کاملOffline Language-free Writer Identification based on Speeded-up Robust Features
This article proposes offline language-free writer identification based on speeded-up robust features (SURF), goes through training, enrollment, and identification stages. In all stages, an isotropic Box filter is first used to segment the handwritten text image into word regions (WRs). Then, the SURF descriptors (SUDs) of word region and the corresponding scales and orientations (SOs) are extr...
متن کاملCombining Multiple Features for Text-Independent Writer Identification and Verification
In recent years, we proposed a number of new and very effective features for automatic writer identification and verification. They are probability distribution functions (PDFs) extracted from the handwriting images and characterize writer individuality independently of the textual content of the written samples. In this paper, we perform an extensive analysis of feature combinations. In our fu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010